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(54) Coprocessor for synthesizing signals based upon quadratic polynomial sinusoids 



(57) A coprocessor (15) for synthesizing a signal 
from the sum of sin usoids is disclosed. The coprocessor 
(15) is preferably included within an electronic system 
having a host processor (1 2) that forwards frame bound- 
ary parameters to the coprocessor (15) . Parameter reg- 
isters (26) are provided in the coprocessor (15) for stor- 
ing synthesis parameters for iteratively deriving ampli- 
tude and phase values for each sample point within a 
data frame. Adders (28, 30, 32) generate the current 
amplitude from one addition, and the current phase val- 
ue from two additions, with the results of the additions 
stored back into the parameter registers (26). A sine 
function calculator circuit (34), such as may be imple- 
mented by way of a CORDIC technique, receives the 
current amplitude and phase values, and generate a 
digital component signal for the current sample point for 
one of the sinusoids. The digital component signal is ac- 
cumulated with that of other sinusoids at the sample 
point in a data sample buffer (40), with the final accu- 
mulated digital components presented at an output (44) 
to the host processor (12). 
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Description 

BACKGROUND OF THE INVENTION 

[0001 ] This invention is in the field of integrated circuit 
architecture, and is more specifically directed to an ar- 
chitecture for a logic integrated circuit for synthesizing 
signals. 

[0002] Many modern electronics systems utilize syn- 
thesized analog signals to carry out their functions. 
These synthesized signals are typically generated by 
circuits that are referred to in the art as frequency syn- 
thesizers. For example, communications systems utilize 
frequency synthesizers to generate carrier and modu- 
lated signals to be transmitted, as well as in the demod- 
ulation of received communications signals. Filtertuning 
circuits in communications systems also utilize signals 
generated by frequency synthesizers. In recent years, 
so-called "frequency agile" communications systems 
have been developed, examples of which include 
spread spectrum local area networks (LANs), frequency 
hopped systems, and Code Division Multiple Access 
(CDMA) cellular telephone systems; each of these fre- 
quency agile systems also rely upon signals generated 
by frequency synthesizer circuits. 
[0003] The rapid increase in the computing powerthat 
may be realized in an integrated circuit, according to 
modern technology, has expanded the ability of elec- 
tronic systems to process multimedia information, in- 
cluding audio signals. Modern multimedia electronic 
systems, and multimedia functions embodied within 
computer systems, use frequency synthesizers to gen- 
erate speech and music audio signals that are modeled 
as a sum of sinusoids, consistent with the Fourier theo- 
rem. For accurate synthesis of speech and audio sig- 
nals, the frequency synthesizers in the multimedia sys- 
tems are called upon to generate thousands of sinu- 
soids at selected frequencies. For music synthesis, this 
sum of sinusoids may be combined with a stochastic 
component that models the "noisy" component of musi- 
cal sounds, to produce an overall realistic sound; this 
stochastic component corresponds to effects such as 
breath noises in wind instruments, bowing noise in 
bowed stringed instruments, and variations in the sound 
of different pianos (e.g., grand versus upright). 
[0004] Figure la illustrates a series of sinusoidal sig- 
nal components to be generated by frequency synthe- 
sizer circuitry in complex electronic multimedia and 
communications systems. For purposes of digital signal 
processing, the time basis of these signals is generally 
subdivided into a series of frames, each frame contain- 
ing a number of sample points. The complete synthe- 
sized signal s n (t) for the n th frame corresponds to the 
sum of M components, according to the relationship: 



s n (t) = |]s^(t) 

m=l 

5 

In the communications context, this complete signal s 
(t), over all frames, corresponds to a single channel; 
multiple channels, each with a synthesized signal s(t), 
are then multiplexed in the conventional fashion to effect 

10 multi-channel communication. 

[0005] According to conventional signal synthesis 
techniques, each signal component within a single 
frame is subject to certain constraints. Atypical example 
of such constraints, expressed for signal s£ (t) of the m th 

*5 component in the n th frame, is: 

s^(t) = A^(t)sin{e^(t)} 

20 

where A£ (t) is the instantaneous amplitude of the signal 
component, and where 8^ (t) is its instantaneous phase. 
As evident from this expression, both the amplitude and 
phase may vary overtime within each frame of the signal 

25 component according to specified time functions. Figure 
1 b illustrates an enlarged view of signal s£ (t) of the m th 
component in the n th frame of Figure 1 a (bounded within 
box 1 0). The time-dependent amplitude A£ (t) within this 
n th frame in this example is shown as a linearly increas- 

30 jng function with time, by way of example. Typically, cer- 
tain constraints specifying the time-dependence of am- 
plitude and phase are specified in the synthesis, such 
as the order (constant, linear, quadratic, etc.) with which 
the amplitude and phase may vary within a frame. Fur- 

35 thermore, while the particular time function for amplitude 
and phase variation may vary (and should vary, in order 
to communicate meaningful information) from frame to 
frame, the component signals s£(t) are required to be 
continuous across frame boundaries. 

40 [0006] From the expression: 

s^(t) = A^(t)sin{e^(t)| 

45 

it can be readily seen that the signal synthesis process 
according to this approach requires three steps, namely 
instantaneous phase calculation, sinusoidal function 
calculation, and sinusoidal amplitude weighting. Logic 
50 circuit architectures for synthesizing signals in this man- 
ner are known in the art. One architecture, described in 
Houghton, et al.,"An ASIC for digital additive sine- wave 
synthesis", Computer Music Journal, Vol. 19, No. 3 
(1 995), pp. 26-31 , effectively integrates the instantane- 
55 ous frequency of the signal component to derive the in- 
stantaneous phase, uses a table lookup for performing 
the sinusoidal calculation, and implements the sinusoi- 
dal amplitude by a multiplier. Another architecture is de- 
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scribed in Bernardinis, et aL "A single chip 1 ,200 sinu- 
soid real-time generator for additive synthesis of musi- 
cal signals", Proc. of the International Conference on 
Acoustics, Speech, and Signal Processing (IEEE, 
1 997), pp. 427-30, according to which a marginally sta- s 
ble second order filter-based oscillator bank generates 
the sinusoids, and a systolic array of multipliers apply 
the correct amplitudes to these sinusoids. 
[0007] Each of these conventional architectures is 
limited in the accuracy with which the resultant signal is 10 
synthesized, primarily due to the limited accuracy with 
which the phases of the synthesized sinusoids are ob- 
tained. The Houghton, et af. approach, in which sinusoid 
phases are generated by integrating instantaneous fre- 
quency, completely ignores phase information and is is 
thus inherently limited in its accuracy. It is believed that 
the Bernardinis, et al. architecture may produce audible 
artifacts in certain situations, for example in the case of 
steep frequency variation. Furthermore, it is believed 
that neither of these known architectures addresses the 20 
need for post-processing the synthesized sinusoids, fur- 
ther limiting their applicability. 

BRIEF SUMMARY OF THE INVENTION 

25 

[0008] It is therefore an object of the present invention 
to provide a logic architecture for efficiently synthesizing 
multicomponent sinusoidal signals. 
[0009] It is a further object of the present invention to 
provide such an architecture that is suitable for imple- 30 
mentation as a coprocessor. 

[001 0] It is a further object of the present invention to 
provide such an architecture for a coprocessor in which 
the number of sinusoid components that may be syn- 
thesized in real time is limited by the data transfer band- 35 
width between the host processor and the coprocessor, 
rather than by the sinusoidal computational rate. 
[001 1] It is a further object of the present invention to 
provide such an architecture in which multi-channel sig- 
nals may be easily and efficiently generated. 40 
[0012] Other objects and advantages of the present 
invention will be apparent to those of ordinary skill in the 
art having reference to the following specification to- 
gether with its drawings. 

[001 3] According to the present invention there is pro- <*s 
vided a coprocessor for synthesis of a digital signal cor- 
responding to a sum of sinusoids, comprising: 

a plurality of parameter registers for storing param- 
eters corresponding to a previous sample point in a so 
frame, the plurality of parameter registers compris- 
ing an amplitude register, a delta amplitude register, 
a phase register, a difference register, and a delta 
difference register; 

a first adderf or adding the contents of the amplitude ss 
register and the delta amplitude register to produce 
a current amplitude value for a current sample point 
in the frame, the first adder having an output cou- 



pled to the amplitude register to update the contents 
thereof; 

a second adder for adding the contents of the dif- 
ference register and the delta difference register, 
the second adder having an output coupled to the 
difference register to update the contents thereof; 
a third adder having inputs coupled to the output of 
the second adder and to the phase register, for add- 
ing at least a portion of the output of the second 
adder to the contents of the phase register to pro- 
duce a current phase value for the current sample 
point in the frame, the third adder having an output 
coupled to the phase register to update the contents 
thereof; 

sine function calculator circuitry, having inputs cou- 
pled to the output of the first adder and to the output 
of the third adder, for generating a digital value cor- 
responding to a sine of the current phase value at 
the current amplitude value; and 
accumulator circuitry, for adding the digital value at 
the output of the sine function calculator circuitry to 
previously calculated digital values for the current 
sample point in the frame. 

[001 4] According to the present invention there is also 
provided a method of synthesizing a digital signal cor- 
responding to a sum of a plurality of sinusoidal compo- 
nents, comprising the steps of: 

for a first frame, corresponding to a period of time 
including a first plurality of sample points, receiving 
frame boundary parameters for each of a plurality 
of sinusoidal components; 

for a first sinusoidal component, initializing the con- 
tents of a plurality of parameter registers corre- 
sponding to a previous sample point in a frame, the 
plurality of parameter registers storing an amplitude 
value, a delta amplitude value, a phase value, a dif- 
ference value, and a delta difference value: 
for each of a plurality of sample points in the frame 
for the first sinusoidal component, iteratively gener- 
ating a sinusoidal sample value having an ampli- 
tude corresponding to a sum of the amplitude value 
and the delta amplitude value, times the sine of a 
phase angle corresponding to a sum of a phase val- 
ue with a sum of the difference value and delta dif- 
ference value; 

storing the sinusoidal sample values for the first si- 
nusoidal component in a buffer; 
repeating the initialization, generating, and storing 
steps for each of a plurality of sinusoidal compo- 
nents, wherein the storing step accumulates the si- 
nusoidal sample values for each of the plurality of 
sample points, over the plurality of sinusoidal com- 
ponents; and 

after the initialization, generating, and storing steps 
for the plurality of sinusoidal components, output- 
ting the accumulated sinusoidal sample values. 
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BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF 
THE DRAWING 

[0015] Figure 1 a is a timing diagram illustrating mul- 
tiple sinusoidal components of a signal that may be syn- 
thesized according to both conventional techniques and 
also according to the preferred embodiment of the in- 
vention. 

[0016] Figure 1b is a timing diagram illustrating one 
frame of one of the component signals shown in Figure 
1a. 

[001 7] Figure 2 is an electrical diagram, in block form, 
of an electronic system constructed according to the 
preferred embodiment of the invention. 
[001 8] Figure 3 is an electrical diagram, in block form, 
of a coprocessor in the electronic system of Figure 2, 
constructed according to the preferred embodiment of 
the invention. 

[0019] Figure 4 is an electrical diagram, in block form, 
of a sine function calculator in the coprocessor of Figure 
3 according to the preferred embodiment of the inven- 
tion. 

[0020] Figure 5 is a memory map of the data sample 
buffer in the coprocessor of Figure 3 according to the 
preferred embodiment of the invention. 
[0021] Figure 6 is a flow diagram illustrating the oper- 
ation of the coprocessor of Figure 3 according to the pre- 
ferred embodiment of the invention. 
[0022] Figure 7 is a flow diagram illustrating the gen- 
eration of sinusoidal component values in the operation 
of the coprocessor illustrated in Figure 6, according to 
the preferred embodiment of the invention. 

DETAILED DESCRIPTION OF THE INVENTION 

[0023] The present invention may be utilized in con- 
nection with any one of a number of electronic devices 
and systems, particularly those in which the synthesis 
of signals based upon the sum of sinusoids is useful. 
Examples of such systems include radar systems using 
pulse compression techniques, and frequency-agile 
communications systems such as spread spectrum lo- 
cal area networks (LANs), frequency hopped systems, 
and Code Division Multiple Access (CDMA) cellular tel- 
ephone systems. The present invention is contemplated 
to be particularly beneficial when used in multimedia 
systems in which signal synthesis is used in connection 
with music and speech audio applications. Given the 
wide range of applications of the present invention, the 
preferred embodiment of the present invention will be 
described in connection with a somewhat generic elec- 
tronic system, it being understood that those skilled in 
the art having reference to this description will be readily 
able to implement the present invention in connection 
with the above-noted, and other, applications for which 
signal synthesis is useful. 

[0024] Figure 2 illustrates exemplary electronic sys- 
tem 20, into which the preferred embodiment of the 



present invention may be implemented. System 20 in- 
cludes host processor 1 2, which is preferably a high per- 
formance digital signal processor (DSP), or general-pur- 
pose microprocessor. For example, particularly in the 

5 context of multimedia electronic systems, it is contem- 
plated that host processor 1 2 may be a high or medium 
performance DSP, such as a member of the 
TMS320C6000 or TMS320C5000 DSP families : re- 
spectively, manufactured by Texas Instruments Incorpo- 

10 rated. In this regard, the processing capacity of host 
processor 12 in system 20 according to this preferred 
embodiment of the present invention is contemplated to 
be substantial, consistent with modern central process- 
ing units (CPUs) that are currently available. 

15 [0025] In this regard, the architecture of exemplary 
system 20 surrounding host processor 1 2 is contemplat- 
ed to be similar to that of many CPU-based systems. In 
this regard, host processor 12 is coupled to memory sys- 
tem 14 by way of memory bus MBUS. Memory system 

20 14 includes random access memory (RAM) for data and 
program storage; additionally, if appropriate for the par- 
ticular application, memory system 14 may include 
read-only memory (ROM) for boot or program storage. 
As shown in Figure 2, cache memory 13 may also be 

25 coupled to host processor 12 via memory bus MBUS, 
for high performance operation. Host processor 12 also 
interfaces with system bus SBUS, by way of which com- 
munication with various input/output devices 18, includ- 
ing disk memory 1 6. may be carried out. Of course, the 

30 presence or absence of disk memory 16, as well as the 
number and nature of the various input/output devices 
18, will depend upon the particular application. 
[0026] According to the preferred embodiment of the 
invention, host processor 12 is also bidirectionally cou- 

35 pled to sinusoid coprocessor 15. Sinusoid coprocessor 
1 5 includes logic for deriving digital data corresponding 
to the sum of sinusoidal components, as will be de- 
scribed in detail hereinbelow. In similar manner as co- 
processors generally, sinusoid coprocessor 15 receives 

40 operands from host processor 12 in combination with 
control signals corresponding to a coprocessor instruc- 
tion, and executes the corresponding routine. Upon 
completion, sinusoid coprocessor 1 5 applies the results 
of the coprocessor routine, in the form of operands or 

45 other data, to post-processing circuitry 17. Post- 
processing circuitry 1 7, in this embodiment of the inven- 
tion, includes circuit functions for using or processing 
synthesized digital signals generated by sinusoid proc- 
essor 15; by way of example, post-processing circuitry 

so 1 7 may include digital filters, effects processors, digital- 
to-analog converters, and the like, individually or in com- 
bination with one another. The output of post-processing 
circuitry 17 in this exemplary realization is shown in Fig- 
ure 2 as an analog signal presented on line ANALOG 

55 OUT. Alternatively, sinusoid coprocessor 15 may for- 
ward the results of the coprocessor routine to host proc- 
essor 12, for additional signal processing thereby. 
[0027] The extent to which system 20, and particularly 
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host processor 12 and coprocessor 15, are physically 
integrated, will depend upon the particular technology 
used as well as the desired application. For example, 
sinusoid coprocessor 15 a separate integrated circuit 
from host processor 12; alternatively, as known in the 5 
art sinusoid coprocessor 15 may be implemented into 
the same integrated circuit chip as host processor 12. 
Additionally, cache memory 13 may be partially or fully 
integrated with host processor 1 2, as is known in the art. 
In any event, the buses illustrated in Figure 2 for system 10 
20 are contemplated to correspond to the interconnec- 
tion of the functional blocks, whether realized within one 
or more integrated circuit devices. 
[0028] According to this preferred embodiment of the 
invention, sinusoid coprocessor 15 generates a digital *5 
signal that corresponds to the sum of multiple sinusoidal 
components. As described above, a model for such a 
synthesized signal s n (t), for the n th frame in a series of 
frames, corresponds to a sum of M components: 

20 
25 

where each component signal s£ (t) is expressed for the 
m th component in the n th frame by: 

s^(t) = A^(t)sin{9^(t)} so 



A£(t) is the instantaneous amplitude of the signal com- 
ponent, and 9£ (t) is its instantaneous phase. According 
to the preferred embodiment of the invention, the instan- 35 
taneous amplitude and phase relationships are estimat- 
ed as linear combinations of linear and quadratic spline 
functions, as follows: 

40 

A:(t)=A> n (t) + A^ 1 A n+1 (t) 
e^(t)=(o m t + a^ 2 B n . 2 (t) + a^ 1 (tJ+aJXct) 

45 

In these expressions, A n (t) and B n (t) are linear and 
quadratic B-splines, respectively, and co m is the nominal 
frequency of this m th component. 
[0029] Also as noted above, each component signal 
s£(t) is continuous across frame boundaries, which so 
places useful constraints upon the modeling of each 
component signal s£ (t). According to this preferred em- 
bodiment of the invention, one may algorithmically cal- 
culate a component signal s£ (t) within each frame given 
the values of frame boundary conditions: This approach 55 
to the calculation of component signal s n (t) is 



{A-.o--'^-- 1 } and {A^\cC} 

This approach to the calculation of component signal s£ 
(t) is described in Ding, et al., "Processing of Musical 
Tones Using a Combined Quadratic Polynomial-Phase 
Sinusoid and Residual (QUASAR) Signal Model", J. Au- 
dio Eng. Soc, Vol. 45, No. 7/8 (July/August 1997), pp. 
571 -584, incorporated herein by this reference. For pur- 
poses of clarity of this description, the theory of opera- 
tion of this algorithm will now be described. 
[0030] Consider each data frame to consist of L sam- 
ples, such that time variable t=nL+k, where k is the index 
of the samples such that k = 0, 1 , L-1 . One may re- 
cursively calculate a first order polynomial by one addi- 
tion, and a second order polynomial by two additions. 
As noted above, the instantaneous amplitude A£(t) is 
expressed in connection with a first order B-spline. Ac- 
cordingly, the instantaneous amplitude for the k th sam- 
ple in the n th frame for the m th sinusoid component may 
be expressed by the sum: 

A^(nL + k)=A^(nL + k-1) + AA^ 

As will be described below, the difference operand AA^ 
is derived from the frame boundary constraints that 
render the sinusoidal component continuous across 
frames. 

[0031] Similarly, because the instantaneous phase 
8£(t) is based upon quadratic B-splines, two additions 
are required for its recursive calculation. According to 
the theory of operation of the preferred embodiment of 
the present invention, these two additions are per- 
formed, for the m th component, by the combination of 
an intermediate difference value x n (k) : 

x:(k)=x>-i) + Ax: 

with an addition to generate the instantaneous sample 
phase G£ (nL+k) as follows: 

e>L+k)=e>L+k-1) +X ;>) 

The difference value Ax" is also derived from the inter- 
na 

frame boundary conditions, as will now be described. 
[0032] As noted above, the frame boundary condi- 
tions set the values of A" and A™" 1 , and of oc n " 2 , a 11 " 1 , 

m m m 1 m ' 

and a£. These boundary conditions set the initial con- 
ditions for the difference values AA" and x" (k), as well 
as other initial conditions. In this regard, the initial value 
of difference value AA£ may be derived as: 
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L 



For the second-order determination of instantaneous 
phase 0£(nl_+k), the following initial conditions are de- 
rived: 



, n-1 n-2 x 

e>L)=<o m nL + <^^J 



n+1 



s n n-1. 
(«m " «m ) 



* n+1 n v 
A n («>m - M m) 
Ax m = [ 



[0033] Beginning from these initial conditions for the 
frame, one may thus derive the values of instantaneous 
amplitude A£(nL+k) and instantaneous phase 6£ 
(nL+k) for the k th sample in the n^ frame of the m th com- 
ponent of the signal. These values of instantaneous am- 
plitude and instantaneous phase may then be combined 
to form the k th sample of the m th sinusoid component in 
frame n as follows: 



s" (nL + k) = (nL + k) ■ sin^ (nL + k)} 



The m sinusoid components are then summed to form 
the overall signal s n (nL+k) at each sample point in frame 
n. 

[0034] Referring now to Figure 3, the construction of 
sinusoid coprocessor 1 5 according to the preferred em- 
bodiment of the invention will now be described in detail. 
While this exemplary architecture is particularly suited 
for and efficient in carrying out signal synthesis accord- 
ing to the present invention, it is of course contemplated 
that variations in this architecture may also be apparent 
to those skilled in the art having reference to this spec- 
ification. Such variations and alternative architectures 
are contemplated to be within the scope of the present 
invention. 

[0035] As shown in Figure 3 , sinusoid coprocessor 1 5 
includes data register and logic 22 for receiving data op- 
erands from host processor 12. These operands corre- 
spond to the frame boundary parameters 
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and are preferably received by way of a direct memory 
access (DMA) operation by host processor 12, or alter- 
natively by way of buffered serial ports. Data register 
and logic 22 also includes memory space for storing oth- 
er parameters involved in the coprocessor operation, 
particularly in initialization of sinusoid coprocessor 15 
for the initial sample of a frame; such other parameters 
include nominal frequencies co£ for each of the m sinu- 
soidal components, and to the frame boundary param- 
eters 



{a; 



^ n-2 r 

,a m ,a 



n-1 



of each component m generated in the previously proc- 
essed frame. Similarly, address decoder and control cir- 
cuitry 24 of sinusoid coprocessor 15 receives instruction 
codes and control signals from host processor 12, ac- 
cording to which sinusoid processor 15 operates to gen- 
erate the synthesized sum-of-sinusoid signals from the 
operands received by data register and logic 22. Addi- 
tionally, address decoder and control circuitry 24 con- 
trols other resources within sinusoid coprocessor 15 to 
accomplish initialization of registers and the like. 
[0036] Parameter registers 26 receive parameters 
from host processor 12 via data register and logic 22, 
underthe control of address decoder and control circuit- 
ry 24. According to the foregoing discussion of the the- 
ory under which sinusoid coprocessor 15 operates, and 
as will become apparent from the following description, 
parameter registers 26 store the various parameters to 
be used in the recursive generation of the synthesized 
signal for the next (k th ) sample period within the current 
(n th ) frame. In this regard, parameter register A stores 
the current instantaneous amplitude value A£(nL+k-1) 
for sample k-1 within frame n, parameter register AA 
stores the current value of AA^ for frame n, parameter 
register AX stores the current difference value Ax£ for 
frame n, parameter register X stores the current value 
of x£(k-1) for the k-1 th sample in the n th frame, and pa- 
rameter register 8 stores the current instantaneous 
phase value 6£(nL+k-1) for sample k-1 of frame n. All 
of these parameters are specific to the m th component 
sinusoid. 

[0037] Adder 28 performs the addition of the contents 
of parameter register A and the contents of parameter 
register AA. In this example, parameter register A 
presents a twenty-bit current instantaneous amplitude 
value A£(nL+k-1), and parameter register A presents 
nine-bit difference value AA" , with the nine bits corre- 
sponding to the nine least significant bits of the twenty- 
bit current instantaneous amplitude value A^(nL+k-1) 
stored in parameter register A. Adder 28 produces a 
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11 

twenty-bit value corresponding to the sum of these pa- 
rameters, namely instantaneous amplitude A£(nL+k) 
for the k th sample in frame n, and returns this twenty-bit 
sum to parameter register A for storage and use in the 
next iteration (for sample k+1 ). The sixteen most signif- 
icant bits of this sum are forwarded to sine function cal- 
culator 34, on lines A n (k) as shown in Figure 3. 
[0038] Adder 30 similarly receives and adds the con- 
tents of parameter registers AX and X. In this example, 
parameter register X stores a twenty-four bit intermedi- 
ate difference value x£ (k-1 ), and parameter register AX 
stores a sixteen bit difference value Ax£ (corresponding 
to the sixteen least significant bits of the contents of pa- 
rameter register X) . Adder 30 produces a twenty-four 
bit sum of these parameters, the sum corresponding to 
the next intermediate difference value x£(k), and 
presents this result to parameter register X for storage 
and use for the next sample k+1 , and to one input of 
adder 32. 

[0039] Adder 32 also receives 3 as a twenty-four bit 
value in this example, the contents of parameter register 
9, which contains current instantaneous phase value 6£ 
(nl_+k-1 ) for sample k-1 . The sum produced by adder 32 
corresponds to the instantaneous phase value 6£ 
(nL+k) for sample k, and is returned to parameter reg- 
ister e for storage and use in the next sample period k+1 . 
The sixteen most significant bits of this value are for- 
warded to sine function calculator 34, on lines en (k). 
[0040] As evident from the foregoing description, sine 
function calculator receives the instantaneous ampli- 
tude and phase values A£ (nL+k), 0£ (nL+k), respective- 
ly, for sample k. As noted above, the actual k th value of 
the m th sinusoidal component at sample point k in frame 
n is given by: 

(nL + k) = (nL + k) • sin{9» (nL + k)} 

or, in other words, by the product of the instantaneous 
amplitude A£(nL+k) and the sine of the instantaneous 
phase 6£(nL+k). In this preferred embodiment of the 
present invention, sine function calculator 34 calculates 
this product. 

[0041] Sine function calculator 34 may be implement- 
ed by way of any one of a number of known circuit and 
algorithmic functions for generating the sine value of a 
digitally presented angle, and for applying an amplitude 
thereto. Examples of useful conventional techniques for 
performing this calculation include Pade approximation, 
and table lookup approaches. Between these two tech- 
niques, table lookup is more likely to be useful in con- 
nection with an integrated circuit such as sinusoid co- 
processor 15, in that a relatively simple ROM table and 
multiplier may be readily implemented to provide the de- 
sired product. 

[0042] According to the preferred embodiment of the 
present invention, however, sine function calculator 34 



12 

is implemented by way of a the Coordinate Rotation Dig- 
ital Computer (CORDIC) technique described in Voider, 
"The CORDIC Trigonometric Computing Technique", 
IRE Transactions on Electronic Computers, Vol. EC-8 

5 (September, 1959), pp. 330-334. 

[0043] As described in the Voider article, trigonomet- 
ric functions of a given angle, at a given magnitude, may 
be iteratively derived by the pseudo-rotation of the cor- 
responding vector to a zero angle. The pseudo-rotation 

'0 is carried out by modifying the x and y vector compo- 
nents by increments derived from an angle of rotation 
that has a decreasing magnitude with each iteration, as 
will now be described. For the case of an initial vector 
having a magnitude A and an angle 6, the CORDIC tech- 

'5 nique may begin by initializing an x component X 1 to the 
magnitude A 1 , an angle value e 1 to the value e, and a y 
component Y 1 to zero. According to the CORDIC tech- 
nique, the second values X 2 , Y 2 of the x and y compo- 
nents, respectively, are determined by: 

x 2 = +Y t 



Y 2 = ±X 1 

which amounts to a ±90° rotation of the initial vector. 
The sign of the rotation depends upon a comparison be- 
tween the initial phase angle 8 and 90°; if the initial 
30 phase angle 6 exceeds 90°, the rotation is positive (such 
that X 2 =-Y 1 and Y 2 =X 1 ) > and vice versa. The next phase 
angle value 0 2 is derived as the difference 0-90°. As de- 
scribed in the Voider article, subsequent iterations for 
X i+1 and Y, +1 are carried out by evaluation of: 

35 

X^-X^^Y, 

40 Y i+1 = Y i + £i 2 ^ >X i 



- 1, for 0 i <a. i 
+ 1, for Q { > ct i 



a, =tan" 1 2™ 

The set of angles otj may be precalculated, as these it- 
55 eration-dependent angles are not data-dependent. 
[0044] The iterative generation of the values of X i+1 , 
Y j+1 is well-suited for execution by digital logic. As evi- 
dent from the above equations, this operation involves 
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the cross -addition of the current X and Y values X h Y h 
where one of the terms is divided by a power of two that 
depends upon the current iteration. Of course, division 
by a power of two may be readily performed simply by 
shifting a digital value. 

[0045] The CORDIC trigonometric determination is 
most easily carried out for a specified number of itera- 
tions, with the total number K of iterations being defined 
according to a worst case error. In the determination of 
the sine of an angle, the result may simply be found as 
the terminal Y value Y K at the end of the K iterations. It 
should be noted that this value should be normalized, 
either prior to the initializing of the X 1 , Y-, values or after 
the K iterations, in order to account for the increase in 
the magnitude of the vectors that results in each cross- 
addition. This magnitude increase depends only upon 
the number of iterations, and not upon the value of the 
operands, and as such the normalization factor may be 
preset in the calculation. 

[0046] Referring now to Figure 4, the construction of 
sine function calculator 34 for calculating the signal s£ 
(k) for the m^ component sinusoid by following the 
CORDIC technique, and according to this preferred em- 
bodiment of the invention, will now be described. Of 
course, other realizations of the CORDIC technique 
may alternatively be used in the implementation of sine 
function calculator 34 in sinusoid coprocessor 1 5. An ex- 
ample of one such alternative realization is described in 
Wang, et al., "Granularly-Pipelined CORDIC Proces- 
sors for Sine and Cosine Generators", Proc. of the In- 
ternational Conference on Acoustics, Speech and Sig- 
nal Processing (IEEE, 1 996). However, it is contemplat- 
ed that the arrangement of Figure 4 is particularly effi- 
cient for implementation into a microprocessor- 1 ike ar- 
chitecture. 

[0047] As shown in Figure 4, lines A n (k) from adder 
28 are received by normalizer48 in sine function calcu- 
lator 34; normalizer 48 adjusts the digital value corre- 
sponding to the current instantaneous amplitude A£ 
(nl_+k) to compensate forthe increase in magnitude pro- 
duced by a specified number of iterations in the 
CORDIC routine, as noted above. Alternatively, the out- 
put of sine function calculator 34 may be so normalized, 
if desired. The output of normalizer 48 is applied to X 
register 50x. Lines 6 n (k) from adder 32 are received by 
phase register 509 of sine function calculator 34, such 
that the instantaneous phase value 6£(nL+k) is stored 
therein. 

[0048] Outputs of X register 50x are applied to right 
shift register 52x, and to one input of adder 62. Similarly, 

Y register 50y (which is initialized to zero) has an output 
applied to right shift register 52y and to one input of 
adder 60. Right shift registers 52x, 52y are each con- 
trolled by the output of index counter 54, to apply a right 
shift by a number of bits corresponding to the iteration 
index i, as described above. The outputs of right shift 
registers 52x, 52y are applied to second inputs of X and 

Y adders 62, 60, respectively (via exclusive-OR function 



58 and exclusive-NOR function 59, as will be described 
below), to effect the CORDIC cross-addition operation. 
Adders 60, 62 also receive a control input from index 
counter 54, which causes adders 60, 62 to simply select 

5 the gated output of right shift registers 52x, 52y in the 
first iteration (with no addition performed), thus applying 
the initial ±90° rotation. The output of X adder 62 is for- 
warded back to X register 50x, to store the new value 
X j+1 for use in the next iteration. The output of Y adder 

10 60, which presents the value Y k1 , is similarly forwarded 
back to Y register 50y for use in the next iteration. Ad- 
ditionally, because sine function calculator 34 is for cal- 
culating a sine function, the output of Y adder 60 is also 
forwarded to adder 36 (Figure 3) in sinusoid coproces- 

15 sor 1 5, to communicate thereto the final result of the sine 
function calculation upon completion of the desired 
number K of iterations. 

[0049] As noted above, the cross-addition operation 
performed by X and Y adders 62, 60, respectively, ap- 
20 piles a pseudo-rotation to the current x and y compo- 
nents Xj, Yj, through an angle corresponding to the cur- 
rent iteration. As also described above, the direction of 
rotation (i.e., the sign of the rotation angle) depends up- 
on the sign of a comparison of the current contents of 
25 phase register 500 with an angle ccj that depends upon 
the iteration index. In this regard, the output of phase 
register 506 is applied to a positive input of adder 57, 
and the output of ATR table 56 is applied to a negative 
input of adder 57. ATR table 56 is a look-up table storing 
30 a sequence of arctangent values that are sequentially 
output from iteration-to-iteration, in response to the cur- 
rent value of index counter 54, thus providing the values 
of angles otj to adder 57. As noted above, these angles 
ctj may be derived as: 

35 

a, = tan- 1 2'™ 

and thus follows a sequence of 90°, 45°, 26.5°,.... These 
40 angles otj are subtracted from the current contents of 
phase register 506 by adder 57, with the resulting differ- 
ence (i.e., angle 6 j+1 ) being stored back into phase reg- 
ister 506 for use in the next iteration. The sign bit of the 
output of adder 57 is forwarded, on line SGN, to an input 
45 of each of exclusive-OR function 58 and exclusive-NOR 
function 59, to effectively pass or invert the shifted out- 
put of right shift registers 52x, 52y, respectively, in re- 
sponse to the angle comparison. As such, exclusive-OR 
function 58 and exclusive-NOR function 59 implement 
50 the £ function described above relative to the CORDIC 
technique. 

[0050] In operation, sine function calculator 34 re- 
ceives the instantaneous amplitude value A£(nL+k) 
from adder 28 (Figure 3) on lines A n (k), and the instan- 
55 taneous phase value 6£(nL+k) from adder 32 on lines 
6 n (k). At this point, index counter 54 is reset (e.g., to the 
value "1 "), and the contents of Y register 50y is reset to 
zero. Normalizer 48 adjusts the instantaneous ampli- 



25 



30 



8 



BNSDOCID: <EP 1 099998 A2_l_> 



15 

tude value A£(nL+k) to account for the increase in am- 
plitude caused by the sine calculation for K iterations, 
and stores the adjusted amplitude value in X register 
50x. The instantaneous phase value 0£(nL+k) on lines 
G n (k) is stored in phase register 50G. 
[0051] For the first iteration, in which a pseudo-rota- 
tion of ±90° is applied, adder 57 compares the value of 
90° (stored in the first location within ATR table 56) with 
the current value 9 1 stored in phase register 509, which 
is instantaneous phase value 8£(nL+k) from adder 32. 
The sign of the result of this subtraction is applied to 
inputs of exclusive-OR function 58 and to exclusive- 
NOR function 59, via line SGN. Meanwhile, the initial 
contents of X register 50x and Y register 50y are stored 
in right shift registers 52x, 52y, respectively. As this is 
the first iteration, noshift is applied by right shift registers 
52x, 52y, and their outputs are applied to exclusive-OR 
function 58 and to exclusive-NOR function 59. One of 
exclusive-OR function 58 and exclusive-NOR function 
59 will invert its input (which, in the case of exclusive- 
NOR function 59, is zero from Y register 50y), and the 
outputs of both will be applied to adders 60, 62, respec- 
tively. Because this is the first iteration, as indicated by 
index counter 54, adders 60, 62 simply forward the out- 
puts of exclusive-OR function 58 and exclusive-NOR 
function 59, respectively, to Y register 50y and X register 
50x. No valid output is generated by Y adder 60 at this 
point in the process. 

[0052] Sine function calculator 34 then continues its 
iterative processing with iteration i=2, up to reaching a 
preselected number K of iterations. In each such itera- 
tion, the values x v yi are forwarded from their respective 
X and Y registers 50 x , 50 y to right shift registers 52x, 
52y, respectively. Right shift registers 52x, 52y output 
these values, shifted by a number of bits indicated by 
the current value of the iteration index i (i.e., shifted right 
by i-2 bits), and applies the shifted results to exclusive- 
OR function 58 and exclusive-NOR function 59, respec- 
tively. Meanwhile, the current phase angle 8j has a cur- 
rent angle otj subtracted therefrom by adder 57 to gen- 
erate next phase angle 0 i+1 , which is stored back into 
phase register 500; the sign of this next phase angle 6 j+1 
is applied to exclusive-OR function 58 and exclusive- 
NOR function 59 to invert the appropriate one of the out- 
puts of right shift registers 52x, 52y. Adders 60, 62 then 
add these values to the current values Y h Xj, respective- 
ly, to produce new values y^ , x i+1 ; these sums are then 
stored back into their respective Y register 50y and X 
register 50x. 

[0053] This iterative process is repeated for a number 
K of iterations, which is typically preselected according 
to a worst case design. Upon completion of the K lh iter- 
ation, the final sine result, which corresponds to the val- 
ue Y K , is output from adder 60 of sine function calculator 
34. This signal is applied to one input of adder 36, as 
shown in Figure 3, to which reference is now again di- 
rected for description of the remainder of the construc- 
tion of sinusoid coprocessor 15. 



16 

[0054] Adder 36 of sinusoid coprocessor 1 5 serves to 
accumulate the signal components for each sample 
point k within frame n, over all M sinusoids. In this re- 
gard, the second input of adder 36 receives a value from 
5 data sample buffer 40, via AND function 39. As shown 
in Figure 3 , the output of data sample buffer 40 is applied 
to one input of multi-bit AND function 39, while a control 
signal generated on line CS by channel separation con- 
trol 38 is applied as a second input thereto, thus gating 
io the application of the output of data sample buffer 40 to 
adder 36. In this manner, channel separation control 38 
controls whether the current output of adder 36 accu- 
mulates with (line CS high), or instead initializes (line 
CS low), the contents of data sample buffer 40. The out- 
's put of adder 36 is temporarily stored in register 42, the 
output of which is applied to an input of data sample buff- 
er 40 and also to output buffer 44. Output buffer 44 
presents the result of the accumulated sample values 
to post-processing circuitry 17 (or, alternatively, to host 

20 processor 12), underthe control of asignal from channel 
separation control 38 on line OE. 
[0055] As noted above, data sample buffer 40 is used 
in the accumulation of signals over the M sinusoids for 
each sample k within frame n, for a given output signal 

25 channel. Figure 5 illustrates the architecture of data 
sample buffer 40 according to the preferred embodi- 
ment of the invention. Data sample buffer 40 includes L 
entries 64, one for each of the L sample periods within 
a data frame. In this example, each ordered entry 64 0 

30 through 64 L . 1 is a sixteen-bit memory location, corre- 
sponding to the sixteen-bit output of (and input to, in this 
case) adder 36. Address decoder 66 receives control 
signals from address decoder and control circuitry 24, 
so as to point to entries 64 in a sequential manner cor- 

35 responding to the operation of sinusoid coprocessor 15. 
In this regard, the operation of data sample buffer 40 will 
point to the entry 64 corresponding to the currently proc- 
essed sample point within the data frame; for example, 
if sinusoid coprocessor 15 is generating signal s£ 

40 (nL+k), address decoder 66 in data sample buffer 40 will 
direct output and input operations to entry 64 k . Data 
sample buffer 40 also includes output circuitry 67 for 
communicating the contents of the entry 64 selected by 
address decoder 66 to adder 36 via AND function 39, 

45 and input circuitry 69 for writing the output from adder 
36 (via register 44) to the entry 64 selected by address 
decoder 66. 

[0056] Referring back to Figure 3, the output of regis- 
ter 42 is also applied to output buffer 44. Output buffer 

so 44 interfaces with post-processing circuitry 1 7 (or, alter- 
natively, with host processor 12), so as to return the re- 
sult of the sinusoidal coprocessing instruction. In this re- 
gard, output buffer 44 returns the final results of the sig- 
nal s n (nl_+k) for all sample points k within the n th frame, 

55 for a given channel. As such, output buffer 44 is control- 
led by an output enable signal on line OE driven by chan- 
nel separation control 38, to enable output buffer 44 to 
present the accumulated output from adder 36 for each 
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sample k within the frame for the M th (i.e., the last) si- 
nusoidal component. This accumulated result will in- 
clude the sum of all M sinusoidal components for each 
of the k samples, with the components m=1 M-1 hav- 
ing been saved within data sample buffer 40, and added 5 
to the final component values of component m=M by 
adder 36. 

[0057] Referring now to Figure 6, the operation of si- 
nusoid coprocessor 15 according to the preferred em- 
bodiment of the present invention will now be described 10 
in detail. As will be apparent from the following descrip- 
tion, the operation illustrated in Figure 6 is applicable to 
multi-channel operations, where separate and inde- 
pendent signals s n (t) are generated for multiple chan- 
nels in each frame. Additionally, the operation illustrated is 
in Figure 6 is described hereinbelow in connection with 
a single frame, it being understood that the process may 
be repeated for such number of frames as to be com- 
municated by system 20 within which sinusoid coproc- 
essor 15 is embodied. 20 
[0058] According to the preferred embodiment of the 
invention, the operation of sinusoid coprocessor 15 is 
initiated by host processor 12 issuing a coprocessor in- 
struction to sinusoid coprocessor 15, in response to 
which sinusoid coprocessor 1 5 begins the generation of 25 
signal components for a frame. Sinusoid coprocessor 
15 begins this generation with the initialization of com- 
ponent index m to 1 , and of sample point index k to 0. 
In process 72, channel separation control 38 disables 
output buffer 44 by driving line OE low. Since, at this 30 
point in the process, the first component (m=1) is to be 
generated for a frame, any previous signal results in da- 
ta sample buffer 40 belong to a previous frame, and 
should not be included in the accumulation of the next 
signal components generated by sinusoid coprocessor 35 
15. Accordingly, channel separation control 38 disables 
accumulation of results into data sample buffer 40 by 
driving line CS low in process 74. 
[0059] Following the initialization of processes 70, 72, 
74, sinusoid coprocessor 15 next generates the signal 40 
contribution for the first (m=1) sinusoidal component at 
the first sample point (k=0) within the current frame, in 
process 76. Referring now to Figure 7, the detailed op- 
eration of sinusoid coprocessor 1 5 in generating the sig- 
nal component in process 76 will now be described in 45 
further detail. 

[0060] Process 76 begins with sinusoid coprocessor 
determining, in decision 89, whether the current iteration 
corresponds to a first sample period (k=0) in the frame. 
As will become further apparent from the description so 
hereinbelow, if process 76 is generating a signal value 
for a later sample point (k>0) in the frame, parameter 
registers 26 will already contain the appropriate values 
for the process. However, for the first sample point 
(k=0), parameter registers 26 will require initialization 55 
with values corresponding to the particular sinusoidal 
component m, as the previously stored values will per- 
tain to a different component. Decision 89 thus deter- 



mines whether initialization of parameter registers 26 is 
necessary by testing the value of sample point index k. 
If k is non-zero (decision 89 is NO), control passes to 
process 94, skipping initialization of parameter registers 
26. In this case, however, since sample point index k is 
zero, decision 89 is YES and initialization continues with 
process 90. 

[0061] In process 90, sinusoid coprocessor 15 re- 
ceives frame boundary parameters from host processor 
12, specifically for the current sinusoidal component m. 
These parameters include, as noted above, amplitude 
and phase coefficient parameters 

{A^cC^cC 1 } and {A»*\cC}. 

These parameters are preferably forwarded to sinusoid 
coprocessor 15 either by way of direct memory access 
(DMA) or buffered serial ports (BSPs) of host processor 
1 2, so as not to occupy the processing capacity of host 
processor 12. Sinusoid coprocessor 1 5 stores these re- 
ceived frame boundary parameters within data register 
and logic 22, for use in the generation of the output sig- 
nal. 

[0062] The particular timing with which the frame 
boundary parameters are forwarded by host processor 
1 2 and received by sinusoid coprocessor 1 5 may be se- 
lected for purposes of efficiency. As shown in Figure 7, 
the communication of frame boundary parameters for 
component m in the current frame may occur directly 
within the coprocessor operation for that component m 
during the frame. Of course, if these frame boundary pa- 
rameters 

{A^cC 2 ^:- 1 } and {A-\«lU 

are available prior to the generation of the signal com- 
ponent for component m in frame n, host processor 12 
may forward such parameters to sinusoid coprocessor 
1 5 for storage therein, assuming that sufficient memory 
capacity is available within data register and logic 22 of 
sinusoid coprocessor 1 5. Such advance communication 
is particularly contemplated for use in connection with 
DMA communication between host processor 12 and 
coprocessor 15. In this regard, it is preferable that data 
register and logic 22 be implemented by way of dual- 
port memory, thus permitting asynchronous receipt of 
frame boundary parameters during the sinusoid synthe- 
sis operation. The provision of process 90 within gener- 
ation process 76 as shown in Figure 7 primarily refers 
to the time by which these parameters must be received 
by sinusoid coprocessor 15 in order to avoid a stall sit- 
uation. 

[0063] Further, one should note that not all of frame 
boundary parameters 
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{A^ct^cC 1 } and {Ar,<} 



need be forwarded for each component in each frame. 
Rather, it will be noted by those skilled in the art having 
reference to this specification that, for a given compo- 
nent m in frame n, the parameters 



the frame boundary parameters by: 



will have been previously used for component m in the 
previous frame n-1 . As such, only the two frame bound- 
ary parameters 



need be communicated for each sinusoidal component 
m in each frame n (outside of the first frame, which will 
require all five frame boundary parameters). One may 
therefore derive the overall data transfer requirements 
from host processor 12 and coprocessor 1 5 as two data 
words per component m per frame per channel, or 2M 
data words per frame. The bandwidth required (in this 
direction) will thus be the product of 2M times the frame 
period, times the number of channels handled by sinu- 
soid coprocessor 15. 

[0064] Once frame boundary parameters 
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A n+1 - A n 



Preferably, the number of samples L within a data frame 
is a power of two, such that this division may be carried 
out by a simple right shift. Instantaneous phase param- 
eter register 9 is initialized with a calculated phase value 
8£(nL) that is derived from the known parameters of 
nominal frequency co m , frame number n, and frame 
length (in number of samples) L, as well as from certain 
of the frame boundary parameters as follows: 



n-1 n-2 



Q n m (nL) = <» m nL + 



Intermediate difference parameter register AX is loaded, 
in process 92, with an initial value that is derived from 
two operations: 



n-1 n-2 
n + 1 n ^ <*m " 
<°m = <*>m + ; 



and then: 



Ax" = 



{A^a^a- 1 } and {a»*\<C} 



have been received in process 90, sinusoid coprocessor 
15 next performs process 92 to initialize parameter reg- 
isters 26 for use in the generation of the signal. This in- 
itialization is performed for each component m in each 
frame, as evident from the location of process 92 in gen- 
eration process 76. Initialization process 92, as will be- 
come apparent from the following description, requires 
the addition, multiplication, and division (preferably di- 
vision by a power of two so as to be carried out by way 
of shifts) of certain parameters. It is contemplated that 
these operations may be carried out by way of adders 
28, 30, 32, and such other circuitry (not shown in Figure 
3), as controlled by address decoder and control circuit- 
ry 24. It is therefore contemplated that those skilled in 
the art will be readily able to effect initialization process 
92 in the most appropriate manner for the particular re- 
alization of sinusoid coprocessor 15. 
[0065] In this regard, parameter register A is initial- 
ized with the frame boundary parameter A£ . Parameter 
register AA is initialized with a delta amplitude value 
AA£ for component m and current frame n, derived from 



The initialized contents of parameter register X is de- 
rived from the initial intermediate difference value pa- 
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55 



rameter Ax " as follows: 

m 



O0) = <o" 



Parameter registers 26 are thus initialized in preparation 
for the generation of sinusoidal component signals for 
the initial sample point k=0 in the current frame, 
[0066] in process 94 or process 94', as the case may 
be, sinusoid coprocessor 15 generates the next values 
of intermediate instantaneous amplitude A£(nL+k) and 
instantaneous phase 6£(nl_+k), for sample point k. For 
the special case of the first sample period (k=0), process 
94' forwards the initialized contents of parameter regis- 
ters A and 6 to sine function calculator 34, considering 
that the contents of these parameter registers already 
correspond to the instantaneous amplitude value A£ 
and instantaneous phase value 6 n (nL) for the initial 
sample point k=0 at the boundary of the current frame 
for the current sinusoidal component m. Adders 28, 30, 
32 in sinusoid coprocessor 15 are effectively bypassed 
in this case. 
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[0067] In the general case where the current sample 
point k is not at the initial frame boundary (i.e., for k>0), 
process 94 derives the next values of i nstantaneous am- 
plitude A£(nL+k) and instantaneous phase 0£(nL+k) 
from the current contents of parameter registers 26. The 
instantaneous amplitude A£(nL+k) is produced by 
adder 28 by the addition of the contents of parameter 
register A with the contents of parameter register AA; 
the twenty-bit output of adder 28 is then forwarded back 
to, and stored in, parameter register A. Adder 28 thus 
produces instantaneous amplitude A£(nL+k), consider- 
ing that the contents of parameter register A applied to 
adder 28 correspond to the instantaneous amplitude A£ 
(nL+k-1) and the contents of parameter register AA 
which corresponds to the delta amplitude value AA£, 
such that the new instantaneous amplitude value A£ 
(nl_+k) is calculated according to the above-noted rela- 
tionship: 

A^(nL+k) = A>L + k-1) + AA^ 

for component m. In this preferred embodiment of the 
invention, the sixteen most significant bits of this result 
is forwarded to sine function calculator 34 on lines A n 
(k), as shown in Figure 3. The higher precision (twenty- 
bit) result is used in successive additions, to avoid the 
accumulation of round-off error. 

[0068] Process 94, for sample points k>0 within the 
frame, also includes the operation of adder 30 that adds 
the contents of parameter register AX to the contents of 
parameter register X; this operation of adder 30 corre- 
sponds to the addition: 

considering that parameter register AX contains the in- 
termediate difference value Ax£ and parameter register 
X contains the value x£(k-1) from the previous sample 
point k-1 . The output of adder 30 is then returned to pa- 
rameter register X for use in connection with the next 
sample point k+1 , and also to one input of adder 32. The 
second input of adder 32 receives the current contents 
of parameter register 0, which contains the instantane- 
ous phase value G£ (nL+k-1) derived in connection with 
the previous sample point k-1 ; adder 32 thus performs 
the addition: 

e>L + k) = e>L + k-i) + x>) 

The output of adder 32, which corresponds to the cur- 
rent instantaneous phase 6£(nL+k) for sample point k, 
is stored back into parameter register 6 for use in con- 
nection with the next sample point k+1 . The sixteen 
most significant bits of the output of adder 32 are also 
forwarded to sine function calculator 34 on lines 6 n (k), 



as shown in Figure 3. 

[0069] In either case, whether by way of process 94 
or, in the case of sample point k=0 in the current frame 
by way of process 94', sinusoid coprocessor 1 5 next ex- 
5 ecutes process 96 by way of which sine function calcu- 
lator 34 determines the value of the signal component 
s£(nL+k), as follows: 

io s^(nL + k) = A n m (nL + k) • sin^ (nL + k)} 

The operation of sine function calculator 34 in perform- 
ing process 96 is described in detail hereinabove, rela- 
ys tive to Figure 4. As a result of this process, sine function 
calculator 34 presents a digital value corresponding to 
the signal component s£ (nL+k) for sample point k within 
frame n for sinusoidal component m. 
[0070] Following process 96, sinusoid coprocessor 
20 15 next adds signal component s£(nl_+k) from the out- 
put of sine function calculator 34 to previous signal com- 
ponents for previously generated sinusoidal compo- 
nents for sample point k in frame n (if any), in process 
98. The addition of process 98 is performed by adder 
25 36, which receives the output of sine function calculator 
34 at one input and the output of AND function 39 at the 
other input. I n the present case where the first sinusoidal 
component for sample point k is generated in process 
96 (i.e., where m=1), HneCS is driven low in process 74 
30 and remains low; this prevents any contents of data 
sample buffer40 from being presented to adder 36, and 
as such adder 36 simply forwards the signal component 
s!J (nL+k) to register 42, and from register 42 to the entry 
64 k of data sample buffer 40 corresponding to sample 
35 point k. As will be noted below, once line CS is driven 
high : the contents of data sample buffer entry 64 k cor- 
responding to the current sample point k are forwarded 
to adder 36; in process 98, adder 36 then adds the new 
signal component s£(nL+k) to the previously accumu- 
40 lated signal components s n (nL+k) for that sample point 
k in frame n, in which case the accumulated sum: 

f>r(nL + k) 

1 = 1 

is then stored in entry 64 k of data sample buffer 40. 
[0071] Referring back to Figure 6, upon completion of 

so process 76 in which the sinusoidal component s£ (nL+k) 
is generated, sinusoid coprocessor 15 next performs 
decision 77 to determine if the last sample point L-1 in 
the current frame is the sample point which was most 
recently processed. If not (decision 77 is NO), the sam- 

55 pie point index k is incremented in process 78, and chan- 
nel separation control 38 enables accumulation by 
adder 36 and data sample buffer 40 (if not already so 
enabled), by driving line CS to a high level, which will 



12 



BNSDOCID: <EP 1 099998A2J_> 



23 EP 1 099 998 A2 24 



cause AND function 39 to forward the output of data 
sample buffer 40 to adder 36 for succeeding iterations. 
Control then passes back to process 76, for generation 
of the next signal component, at the next sample point 
within frame n, for the current sinusoidal component m. 
[0072] Upon decision 77 determining that all sample 
points within frame n have been processed for a given 
sinusoidal component (decision 77 is YES), sinusoid co- 
processor 15 next executes decision 79 to determine if 
the last sinusoidal component has been processed, by 
comparing the current component index m to the 
number M of components in the signal. If additional com- 
ponents remain to be processed (decision 79 is NO), 
sinusoid coprocessor 15 next evaluates decision 81 to 
determine whether the next sinusoidal component is the 
last one, by comparing the current value of component 
index m to the value M-1. If the next component is not 
the last component of the signal (decision 81 is NO), 
sinusoid coprocessor 15 increments component index 
m in process 84, resets sample point index k to zero in 
process 88, and returns control to process 76 for the 
generation of the first sample point value for the next 
sinusoidal component. The operations of process 76 
and decisions 77, 79 then continues to process this next 
component. 

[0073] If the next component is indeed the last com- 
ponent (decision 81 is YES), however, channel separa- 
tion control 38 then enables output buffer 44 in process 
82, by driving line OE to an active level. Control then 
passes to process 82 for the incrementing of the com- 
ponent index (to the value M) and to process 88 for the 
resetting of sample point index k. With output buffer 44 
enabled by process 82 in this instance, the output pro- 
duced from process 76 at adder 36, via register 42, will 
be driven by output buffer 44 for receipt by host proces- 
sor 12, for each of the L sample points for the current 
frame. Upon completion of all L sample points in the 
frame (decision 77 is YES), decision 79 will also return 
a YES result because, at this point, component index m 
will equal the terminal value M. Sinusoid coprocessor 
15 will then perform decision 83 to determine whether 
additional channels (i.e., independent signals) remain 
to be generated in the current frame. If so (decision 83 
is YES), control passes back to process 70 for initializa- 
tion of sinusoid coprocessor 15 for this next channel's 
signal. If not (decision 83 is NO), control passes to proc- 
ess 88 in which host processor 12 and sinusoid coproc- 
essor 15 carry out the appropriate actions, by way of 
issuing and executing instructions, for example, to ad- 
vance to the next frame following which the process of 
Figure 6 may be repeated, if desired. 
[0074] The architecture, construction, operation, and 
use of a sinusoid coprocessor according to the present 
invention provides numerous advantages in the synthe- 
sis of signals. Firstly, the arithmetic operations per- 
formed by the coprocessor in signal synthesis are rela- 
tively simple and may be carried out primarily by way of 
adds and shifts. As such, it is contemplated that the 



computational rate achievable by relatively modest cir- 
cuitry may be quite high. For example, through simula- 
tion, it is contemplated that a sinusoid coprocessor ac- 
cording to the preferred embodiment of the invention 
5 may be realized in approximately 9,500 logic gates with 
2k bits of dual-port memory, and that, using modern 
0.25|i CMOS technology for transistor construction, 
such a coprocessor can achieve on the order of 95 mil- 
lion sine computations per second. At this performance 

10 level, it has been estimated that a signal composed of 
over 2000 sinusoidal components may be synthesized, 
in real-time, at a sampling rate of 44.1 kHz as suitable 
for music synthesis. It is therefore contemplated that the 
rate at which signals may be synthesized by a sinusoid 

15 coprocessor according to the present invention will be 
limited by the data transfer bandwidth over which an as- 
sociated host processor may communicate the frame 
boundary parameters, rather than by the internal com- 
putational rate of the coprocessor. 

20 [0075] Furthermore, the present invention facilitates 
the generation of multi-channel signal synthesis, by pro- 
viding control for the separation of channels in the ac- 
cumulation of signal components and the control of the 
output of the signal values. 

25 [0076] Still further, it is contemplated that the relative- 
ly modest circuit requirements for a sinusoid coproces- 
sor according to the present invention will enable the 
implementation of such sinusoid coprocessors in a wide 
range of electronic systems, including low-cost systems 

30 suitable for introduction into the consumer market. 

[0077] While the present invention has been de- 
scribed according to its preferred embodiments, it is of 
course contemplated that modifications of, and alterna- 
tives to, these embodiments, such modifications and al- 

35 ternatives obtaining the advantages and benefits of this 
invention, will be apparent to those of ordinary skill in 
the art having reference to this specification and its 
drawings. It is contemplated that such modifications and 
alternatives are within the scope of this invention as sub- 

^0 sequently claimed herein. 



Claims 

45 1. A coprocessor for synthesis of a digital signal cor- 
responding to a sum of sinusoids, comprising: 

a plurality of parameter registers for storing pa- 
rameters corresponding to a previous sample 

50 point in a frame, the plurality of parameter reg- 

isters comprising an amplitude register, a delta 
amplitude register, a phase register, a differ- 
ence register, and a delta difference register; 
a first adder for adding the contents of the am- 

55 plitude register and the delta amplitude register 

to produce a current amplitude value for a cur- 
rent sample point in the frame, the first adder 
having an output coupled to the amplitude reg- 
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ister to update the contents thereof; 
a second adder for adding the contents of the 
difference register and the delta difference reg- 
ister, the second adder having an output cou- 
pled to the difference register to update the con- 
tents thereof; 

a third adder having inputs coupled to the out- 
put of the second adder and to the phase reg- 
ister, for adding at least a portion of the output 



26 

of the accessed entry of the data sample buffer 
to produce an accumulated result; 

wherein the data sample buffer writes the ac- 
5 cumulated result at its input to the accessed entry 

of the data sample buffer. 

6. The coprocessor of claim 5, further comprising: 

a gate, having a data input coupled to the output 
of the data sample buffer, having a control in- 
put, and having an output coupled to the sec- 
ond input of the accumulator adder; and 
channel separation control, having a first output 
coupled to the control input of the gate, for 
blocking the output of the data sample buffer 
from the second input of the accumulator adder 
for a first sinusoid of the sum of sinusoids. 

The coprocessor of claim 6, wherein the channel 
separation control has a second output coupled to 
a control input of the output buffer, for enabling the 
output buffer to present accumulated digital values 
to the host processor for a last sinusoid of the sum 
of sinusoids. 

The coprocessor of any preceding claim, wherein 
the sine function calculator circuitry comprises: 

a phase register, having an input coupled to the 
output of the third adder for receiving the cur- 
rent phase value; 

an X register, having an input coupled to the 
output of the first adder for receiving the current 
amplitude value; 
a Y register; 

a first right shift register, coupled to the output 
of the X register, for shifting its contents by a 
selected number of bits; 
a second right shift register, coupled to the out- 
put of the Y register, for shifting its contents by 
the selected number of bits; 
first and second inverting circuits, each having 
a control input, for selectively inverting the out- 
puts of the first and second right shift registers, 
respectively, responsive to a signal at the con- 
trol input; 

a Y adder, having a first input coupled to the Y 
register and a second input coupled to an out- 
put of the first inverting circuit, and having an 
output coupled to an input of the Y register and 
to the accumulator circuitry; 
an X adder, having a first input coupled to the 
X register and a second input coupled to an out- 
put of the second inverting circuit, and having 
an output coupled to an input of the X register; 
and 

a phase comparator, having a first input cou- 



of the second adder to the contents of the 10 
phase register to produce a current phase val- 
ue for the current sample point in the frame, the 
third adder having an output coupled to the 
phase register to update the contents thereof; 
sine function calculator circuitry, having inputs is 
coupled to the output of the first adder and to 
the output of the third adder, for generating a 
digital value corresponding to a sine of the cur- 
rent phase value at the current amplitude value; 
and 20 
accumulator circuitry, for adding the digital val- 
ue at the output of the sine function calculator 
circuitry to previously calculated digital values 
for the current sample point in the frame. 
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The coprocessor of claim 1 , further comprising: 

data register and logic circuitry, for receiving 
parameters from a host processor and for initializing 
the plurality of parameter registers. 

The coprocessor of claim 2, wherein the data reg- 
ister and logic circuitry include memory space for 
storing parameters corresponding to a plurality of 
sinusoids of the sum of sinusoids. 

The coprocessor of any one of claims 1 to 3, further 
comprising: 

an output buffer, coupled to the accumulator 
circuitry, for presenting accumulated digital values. 

The coprocessor of claim 4. wherein the accumula- 
tor circuitry comprises: 



a data sample buffer, having a plurality of en- 
tries, each entry corresponding to a sample 45 
point in the frame, and having addressing cir- 
cuitry for accessing the contents of the one of 
the entries corresponding to the current sample 
point in the frame; and 

an accumulator adder, having a first input cou- 50 
pled to the output of the sine function calculator 
circuitry, having a second input coupled to an 
output of the data sample buffer to receive the 
contents of the accessed entry, and having an 
output coupled to the output buffer and to an 55 
input of the data sample buffer, the adder being 
for adding the digital value at the output of the 
sine function calculator circuitry to the contents 
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pled to the phase register and having a second 
input coupled to an angle table, having a sign 
output coupled to the control inputs of the first 
and second inverting circuits, and having an 
output coupled to the phase register. s 

9. A method of synthesizing a digital signal corre- 
sponding to a sum of a plurality of sinusoidal com- 
ponents, comprising the steps of: 

w 

for a first frame, corresponding to a period of 
time including a first plurality of sample points, 
receiving frame boundary parameters for each 
of a plurality of sinusoidal components; 
for a first sinusoidal component, initializing the 15 
contents of a plurality of parameter registers 
corresponding to a previous sample point in a 
frame, the plurality of parameter registers stor- 
ing an amplitude value, a delta amplitude value, 
a phase value, a difference value, and a delta 20 
difference value; 

for each of a plurality of sample points in the 
frame for the first sinusoidal component, itera- 
tively generating a sinusoidal sample value 
having an amplitude corresponding to a sum of 25 
the amplitude value and the delta amplitude 
value, times the sine of a phase angle corre- 
sponding to a sum of a phase value with a sum 
of the difference value and delta difference val- 
ue; 30 
storing the sinusoidal sample values for the first 
sinusoidal component in a buffer; 
repeating the initialization, generating, and 
storing steps for each of a plurality of sinusoidal 
components, wherein the storing step accumu- 35 
lates the sinusoidal sample values for each of 
the plurality of sample points, over the plurality 
of sinusoidal components; and 
after the initialization, generating, and storing 
steps for the plurality of sinusoidal compo- 40 
nents, outputting the accumulated sinusoidal 
sample values. 



ue; 

updating the difference value in the parameter 
registers with the current difference value; 
adding the current difference value and the 
phase value to produce a current phase value; 
updating the phase value in the parameter reg- 
isters with the current phase value; and 
calculating the sinusoidal sample value from 
the current amplitude and current phase val- 
ues. 

12. The method of claim 11, wherein the calculating 
step comprises: 

loading a first register with the current ampli- 
tude value; 

loading a phase register with the current phase 
value; 

for a preselected plurality of iterations, perform- 
ing a sequence of operations comprising: 

comparing contents of the phase register 
with one of a sequence of angles; 
updating the contents of the phase register 
with a difference resulting from the com- 
paring step; 

cross-adding first and second shifted val- 
ues to the contents of a second register 
and the first register, respectively, the first 
and second shifted values corresponding 
to the contents of the first and second reg- 
isters, respectively, shifted right by a 
number of bits corresponding to the itera- 
tion, and wherein the signs of the cross-ad- 
dition correspond to a sign value generated 
in the comparing step; and 
updating the contents of the first and sec- 
ond registers with the results of the cross- 
adding step; 

wherein the sinusoidal sample value is out- 
put from the second register after the 
preselected number of iterations. 



10. The method of claim 9, further comprising: 

repeating the receiving step, the repeated in- 
itialization, generating, and storing steps, and the 
outputting step, for a plurality of frames. 



13. The method of any one of claims 9 to 12, wherein 
the storing step comprises: 

retrieving the contents of one of a plurality of 
entries of the buffer, the entry corresponding to 
a current sample point; 

adding the sinusoidal sample value for the cur- 
rent sample point to the retrieved contents to 
produce an accumulated sinusoidal sample 
value for the current sample point and 
storing the accumulated sinusoidal sample val- 
ue in the entry of the buffer corresponding to 
the current sample point. 

14. The method of claim 13, further comprising: 



1 1 . The method of claim 9 or claim 1 0, wherein the gen- 
erating step comprises: so 

adding the amplitude value and the delta am- 
plitude value to produce a current amplitude 
value; 

updating the amplitude value in the parameter 55 
registers with the current amplitude value; 
adding the difference value and the delta differ- 
ence value to produce a current difference val- 
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during the generating and storing steps forthe 
first sinusoidal component, blocking the adding of 
the sinusoidal sample value for the current sample 
point to the retrieved contents. 

5 

15. The method of claim 13 or claim 14, wherein the 
outputting step comprises: 

outputting the accumulated sinusoidal sam- 
ple values during the generating step for a last si- 
nusoidal component. *0 
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(57) A coprocessor (15) for synthesizing a signal 
from the sum of sinusoids is disclosed. The coprocessor 
(15) is preferably included within an electronic system 
having a host processor (1 2) that forwards frame bound- 
ary parameters to the coprocessor (15) . Parameter reg- 
isters (26) are provided in the coprocessor (1 5) for storing 
synthesis parameters for iterative ly deriving amplitude 
and phase values for each sample point within a data 
frame. Adders (28, 30, 32) generate the current ampli- 
tude from one addition, and the current phase value from 
two additions, with the results of the additions stored back 
into the parameter registers (26). A sine function calcu- 
lator circuit (34), such as may be implemented by way of 
a CORDIC technique, receives the current amplitude and 
phase values, and generate a digital component signal 
for the current sample point for one of the sinusoids. The 
digital component signal is accumulated with that of other 
sinusoids at the sample point in a data sample buffer 
(40), with the final accumulated digital components pre- 
sented at an output (44) to the host processor (12). 
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